Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

The OCRopus Open Source OCR System

Identifieur interne : 000D02 ( Main/Exploration ); précédent : 000D01; suivant : 000D03

The OCRopus Open Source OCR System

Auteurs : Thomas M. Breuel [Allemagne]

Source :

RBID : Pascal:08-0399962

Descripteurs français

English descriptors

Abstract

OCRopus is a new, open source OCR system emphasizing modularity, easy extensibility, and reuse, aimed at both the research community and large scale commercial document conversions. This paper describes the current status of the system, its general architecture, as well as the major algorithms currently being used for layout analysis and text line recognition.


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en" level="a">The OCRopus Open Source OCR System</title>
<author>
<name sortKey="Breuel, Thomas M" sort="Breuel, Thomas M" uniqKey="Breuel T" first="Thomas M." last="Breuel">Thomas M. Breuel</name>
<affiliation wicri:level="3">
<inist:fA14 i1="01">
<s1>DFKI and U. Kaiserslautern</s1>
<s2>Kaiserslautern</s2>
<s3>DEU</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
<country>Allemagne</country>
<placeName>
<region type="land" nuts="2">Rhénanie-Palatinat</region>
<settlement type="city">Kaiserslautern</settlement>
</placeName>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">INIST</idno>
<idno type="inist">08-0399962</idno>
<date when="2008">2008</date>
<idno type="stanalyst">PASCAL 08-0399962 INIST</idno>
<idno type="RBID">Pascal:08-0399962</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000271</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000513</idno>
<idno type="wicri:Area/PascalFrancis/Checkpoint">000224</idno>
<idno type="wicri:Area/Main/Merge">000D14</idno>
<idno type="wicri:Area/Main/Curation">000D02</idno>
<idno type="wicri:Area/Main/Exploration">000D02</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a">The OCRopus Open Source OCR System</title>
<author>
<name sortKey="Breuel, Thomas M" sort="Breuel, Thomas M" uniqKey="Breuel T" first="Thomas M." last="Breuel">Thomas M. Breuel</name>
<affiliation wicri:level="3">
<inist:fA14 i1="01">
<s1>DFKI and U. Kaiserslautern</s1>
<s2>Kaiserslautern</s2>
<s3>DEU</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
<country>Allemagne</country>
<placeName>
<region type="land" nuts="2">Rhénanie-Palatinat</region>
<settlement type="city">Kaiserslautern</settlement>
</placeName>
</affiliation>
</author>
</analytic>
<series>
<title level="j" type="main">Proceedings electronic imaging science and technology</title>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt>
<title level="j" type="main">Proceedings electronic imaging science and technology</title>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Algorithm</term>
<term>Character recognition</term>
<term>Optical character recognition</term>
<term>Pattern recognition</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr">
<term>Algorithme</term>
<term>Reconnaissance optique caractère</term>
<term>Reconnaissance caractère</term>
<term>Reconnaissance forme</term>
<term>0130C</term>
<term>4230S</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">OCRopus is a new, open source OCR system emphasizing modularity, easy extensibility, and reuse, aimed at both the research community and large scale commercial document conversions. This paper describes the current status of the system, its general architecture, as well as the major algorithms currently being used for layout analysis and text line recognition.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>Allemagne</li>
</country>
<region>
<li>Rhénanie-Palatinat</li>
</region>
<settlement>
<li>Kaiserslautern</li>
</settlement>
</list>
<tree>
<country name="Allemagne">
<region name="Rhénanie-Palatinat">
<name sortKey="Breuel, Thomas M" sort="Breuel, Thomas M" uniqKey="Breuel T" first="Thomas M." last="Breuel">Thomas M. Breuel</name>
</region>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000D02 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 000D02 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     Pascal:08-0399962
   |texte=   The OCRopus Open Source OCR System
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024